Apparatus and method of embedding meta-data in a captured image

ABSTRACT

Methods and apparatuses generate and render data embedded with associated meta-data in a human or machine recognizable format. The apparatus includes an obtaining device to acquire first data in a predetermined format and associated second data comprising information of the first data, and to output the first data and the associated second data; a processing device to receive the first data and the associated second data from the obtaining device, to process the first data and the associated second data to thereby generate meta-data based on the first data and/or the associated second data, to convert the meta-data into the predetermined format of the first data, and to embed the converted meta-data into the first data as a combined data in the predetermined format; and a rendering device to receive the combined data from the processing device, and to render the combined data in the human or machine recognizable format.

BACKGROUND OF THE INVENTION

1. Field of the Invention

Aspects of the present invention relate to a method of embedding meta-data to an image, and an apparatus to embed the meta-data to the image. More specifically, aspects of the present invention relate to a method of embedding an associated meta-data of an image into the image, and an apparatus to embed the associated meta-data into the image so that the associated meta-data and the image may be visually perceived by a human user or programmatically perceived by a device, or both.

2. Description of the Related Art

Various devices and peripherals allow a user to capture an image of various locations or items in the form of image files (such as Joint Photographic Experts Group (JPEG)) or video (such as Moving Picture Experts Group (MPEG4)), or capture sounds in the form of audio files (such as MP3). The various devices and peripherals may further collect additional information for the captured image, video, or sounds contemporaneously. Examples of such additional information include time, geographical location, temperature, identification, and other properties of interest to the user.

SUMMARY OF THE INVENTION

To effectively use additional information that are collected with captured images, videos, or sounds, the additional information should be transformed into a usable format, and embedded into a file of the captured images, videos, or sounds in the usable format. Specifically, the embedded additional information is converted into meta-data having the same format as the captured images, videos, or sounds, and then embedded into the captured images, videos, or sounds. Once the additional information is transformed and imbedded, the imbedded additional information and the file may be rendered and perceived by a human user, a device, or both.

According to an aspect of the present invention, an apparatus to generate and render data embedded with associated meta-data in a human or machine recognizable format, includes an obtaining device to acquire first data in a predetermined format and associated second data comprising information of the first data, and to output the first data and the associated second data; a processing device to receive the first data and the associated second data from the obtaining device, to process the first data and the associated second data to thereby generate meta-data based on the first data and/or the associated second data, to convert the meta-data into the predetermined format of the first data, and to embed the converted meta-data into the first data as a combined data in the predetermined format; and a rendering device to receive the combined data from the processing device, and to render the combined data in the human or machine recognizable format.

According to an aspect of the present invention, an apparatus to generate and render an image embedded with associated meta-data, includes a data acquisition driver to acquire an image, and associated data comprising information of the image; a data processor to process the acquired image and the associated data from the data acquisition driver in order to generate meta-data based on the image, the associated data, or both, to convert the meta-data into a same format as that of the image, and to embed the converted meta-data into the image; and a rendering driver to receive the image embedded with the converted meta-data from the data processor, and to render the image with the embedded converted meta-data as an output image.

According to an aspect of the present invention, a method of generating and rendering data embedded with associated meta-data in a human or machine recognizable format, includes obtaining first data in a predetermined format, and second data that is associated with the first data; processing the obtained first data and the associated second data to generate meta-data based on the first data, and/or the associated second data, converting the meta-data into the predetermined format of the first data; embedding the converted meta-data into the first data to obtain a combined data; and rendering the combined data in the predetermined format of the first data, the predetermined format of the first data being the human or machine recognizable format.

According to an aspect of the present invention, a method of embedding meta-data in an image to be rendered together, includes obtaining the image; obtaining associated data contemporaneously with the image, the associate data being additional information of the image and includes a user input data, environmental data, and/or collected data that is associated with the image; extracting meta-data from the image that characterizes the image according to user selection as an extracted meta-data; generating a consolidated meta-data by consolidating the associated data and the extracted meta-data; generating a meta-data sub-image, which is a visual representation of the consolidated meta-data, by converting the consolidated meta-data into the same format as the image; embedding the meta-data sub-image into the image to generate a combined image in the same format as the image; and visually rendering the combined image on a medium or a display device.

According to an aspect of the present invention, a visually rendered combined image formed a medium, the combined image comprising: a photographic image in a predetermined format; and an embedded meta-data sub-image that is positioned in a predetermined position of the photographic image, the embedded meta-data sub-image being an image representation of information of the photographic image, and being in the same predetermined format as the photographic image.

Additional aspects and/or advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

These and/or other aspects and advantages of the invention will become apparent and more readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:

FIG. 1 illustrates an input device to obtain an image and associated information according to an aspect of the present invention;

FIG. 2 illustrates a schematic of an apparatus 200 to embed meta-data to an obtained data according to an aspect of the present invention;

FIG. 3 illustrates a schematic of an apparatus to embed meta-data in an image according to an aspect of the present invention;

FIG. 4 illustrates a method of embedding meta-data to an obtained data according to an aspect of the present invention;

FIG. 5 illustrates a method of embedding meta-data in an image according to an aspect of the present invention; and

FIG. 6A illustrates an image with an embedded meta-data, referred to as a combined image, and 6B illustrates a standalone meta-data representation, which may be affixed to a separately rendered image, according to aspects of the present invention.

DETAILED DESCRIPTION OF THE EMBODIMENTS

Reference will now be made in detail to aspects of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The aspects are described below in order to explain the present invention by referring to the figures.

A method is herein conceived to be a sequence of steps or actions leading to a desired result and may be implemented as software. While it may be convenient to discuss such software as if embodied by a single program, most implementations will distribute the described functions among discrete (and some not so discrete) pieces of software. These pieces are often described using such terms of art as “programs,” “objects,” “functions,” “subroutines,” “libraries,” “.dlls,” “APIs,” and “procedures.” While one or more of these terms may be described in aspects of the present invention, there is no intention to limit the scope of the claims.

With respect to the software described herein, those of ordinary skill in the art will recognize that there exist a variety of platforms and languages for creating software for performing the methods outlined herein. Aspects of the present invention can be implemented using MICROSOFT VISUAL STUDIO or any number of varieties of C. However, those of ordinary skill in the art also recognize that the choice of the exact platform and language is often dictated by the specifics of the actual system constructed, such that what may work for one type of system may not be efficient on another system. It should also be understood that the methods described herein are not limited to being executed as software on a microprocessor, but may be executed using other circuits. For example, the methods could be implemented on a digital signal processor, a FPGA, or with HDL (Hardware Design Language) in an ASIC.

FIG. 1 illustrates an input device to obtain an image and associated information according to an aspect of the present invention. As shown in FIG. 1, illustrated is a type of data collection device referred to as a portable data terminal (PDT) as an example of the input device. A PDT generally integrates a mobile computer, one or more data transport paths and one or more data collection subsystems. The mobile computer portion is generally similar to a typical touch screen consumer oriented portable computing devices (e.g. “Pocket PCs” or “PDAs”), such as those available from PALM®, HEWLETT PACKARD®, and DELL®. The data transport paths include wired and wireless paths, such as 802.11, IrDA, BLUETOOTH, RS-232, USB, CDMA, GSM (incl. GRPS), and so forth. The data collection subsystem generally comprises a device that captures data from an external source, for example, touches, keystrokes, RFID signals, images, and bar codes. The PDT is distinguished from typical consumer oriented portable computing devices through the use of “industrial” components integrated into a housing that provide increased durability, ergonomics, and environmental independence over the typical consumer oriented devices. Additionally, the PDT tends to provide improved battery life by utilizing superior batteries and power management systems. Referring back to FIG. 1, the PDT 100 utilizes an elongated body 102 supporting a variety of components, including: a battery (not illustrated); a touch screen 106 (generally comprising an LCD screen under a touch sensitive panel); a keypad 108 (including a scan button 108 a); a scan engine (not illustrated); and a data/charging port (also not illustrated). The scan engine may comprise, for example, one or more of an image engine, a laser engine, or an RFID engine. The scan engine is generally located near a top end 110 of the PDT 100 and is used to scan markings such as product codes. The data/charging port typically comprises a proprietary mechanical interface with one set of pins or pads for transmitting and receiving data (typically via a serial interface standard such as USB or RS-232) and a second set of pins or pads for receiving power for operating the system and/or charging the battery. The data charging port is generally located near a bottom end 111 of the PDT 100.

In use, the user presses the scan key 108 a to initiate data capture via the scan engine. The captured data is analyzed, e.g. decoded to identify the information represented, stored and, displayed on the touch screen 106. Additional processing of the data may take place on the PDT 100 and/or an external data processing resource to which the data is transmitted.

In other aspects of the present invention, the scan key 108 a or another key may be used to initiate image capture via the image engine for further processing. In such a case, the image engine may be used to obtain an image of a subject, such as merchandise. Additionally, the PDT 100 may have a microphone to capture sounds, sensors to measure temperature or other environmental information, a GPS system to obtain position information of a location, a wireless connection to connect to a network or the internet, for example.

FIG. 2 illustrates a schematic of an apparatus to embed meta-data to an obtained data according to an aspect of the present invention. As shown in FIG. 2, the apparatus 200 includes an obtaining device 210, a processing device 220, and a rendering device 230.

Specifically, the obtaining device 210 acquires, or receives input of first data, and second data that is associated or corresponds to the first data, and outputs the first data and the associated second data to the processing device 220. In turn, the processing device 220 processes the acquired or received first data and the associated second data, generates meta-data based on the first data, the associated second data, or both, converts the meta-data into a format of the first data (referred to as a sub meta-data) or another format (referred to as a block meta-data), and either embeds the meta-data into the first data as a combined data or provides the block meta-data for later rendering as a standalone meta-data.

The processing device 220 then provides the combined data and the block meta-data to the rendering device 230 for rendering. The rendering device 230 receives the combined data and/or the block meta-data from the processing device 220, and outputs the combined data in a predetermined format, and/or renders the block meta-data as standalone meta-data in the same or different format as the predetermined format. In aspects of the present invention, the first and second data may be an image, video, audio, sound, text, music, or in other human user or device perceivable formats.

FIG. 3 illustrates a schematic of an apparatus to embed meta-data in an image according to an aspect of the present invention. As shown in FIG. 3, the apparatus 1000 includes a data acquisition driver 1100, a data processor 1200, and a data rendering driver 1300.

In aspects of the present invention, the data acquisition driver 1100 acquires, or receives input of, image data, and other data corresponding to the image data (referred to as an associated data), and outputs the image data and the associated data to the data processor 1200. In turn, the data processor 1200 processes the acquired or received image data and the associated data, generates meta-data based on the image data, the associated data, or both, converts the meta-data into a visual format (referred to as a sub-visual meta-data) or a data block format (referred to as a block meta-data), and either embeds the sub-visual meta-data into the image data or provides the block meta-data for later rendering. Additionally, the rendering driver 1300 receives the image data that is embedded with the sub-visual meta-data and/or the block meta-data from the data processor 1200, and outputs the image data that is embedded with the sub-visual meta-data as an output image, or renders the block meta-data as a visual standalone meta-data, or both.

Referring to FIG. 3 in greater detail, the data acquisition driver 1100 includes a visual image capture driver 1110 and a meta-data capture driver 1120. In aspects of the present invention, the visual image capture driver 1110 may be a device, software (SW), or firmware (FW) of the apparatus 1000, and may simply obtain an image from a visual capture device 1130, or may trigger the visual capture device 1130 to capture the image, and output the obtained or captured image to the visual image capture driver 1110. The visual image capture driver 1110 may also process the obtained or captured image, which may be in a predetermined data format, and convert the image into another predetermined data format or may preprocess the image for later processing in the data processor 1200. The visual image capture driver 1110 then outputs the image in the predetermined data format, or which has been preprocessed, to the data processor 1200.

In aspects of the present invention, the visual capture device 1130 may be any device capable of capturing or obtaining an image. For example, the visual capture device 1130 may be a built-in and/or externally connected video capture device, which is used to take a picture, a video, or a sequence of pictures of an item or a location, such as inventory or a warehouse. Other examples of the visual capture device 1130 include a digital camera, a scanner, a camcorder, a cellphone having a camera function, a portable data terminal (PDT), and a webcam, for example, and may also encompass built-in optical imager device, and an external video camera connected over BLUETOOTH. The image obtained by the visual capture device 1130 maybe in any image format, including JPEG (Joint Photographic Experts Group), PDF (Portable Document Format), TIFF (Tagged Image File Format), or MPEG (Moving Picture Experts Group), for example.

While the visual image capture driver 1110 obtains an image from the visual capture device 1130, the meta-data capture driver 1120 obtains a user input, environmental data, and/or collected data to associate with the obtained image, from a user input device 1140, one or more environment sensors, and/or a smart meta-data collection agent. Instead of simply obtaining the user input, the environmental data, and/or the collected data, the meta-data capture driver 1120 may trigger the user input device 1140 to obtain and output the user input, the one or more environment sensors 1150 to obtain and output the environmental data, and the smart meta-data collection agent 1160 to obtain and output the collected data. Each of the user input device 1140, the one or more environment sensors 1150, and the smart meta-data collection agent 1160 outputs the respective user input, the environmental data, and/or the collected data to the data processor 1200, and especially, the meta-data capture driver 1120.

In aspects of the present invention, the user input device 1140 may be any device capable of obtaining an input from a user. For example, the user input device 1140 may be a built-in and/or externally connected user-system interaction device, which is used to obtain preferences and various data input from the user. Examples of the user input device 1140 include a keyboard, a key pad, a mouse, a touch screen, a touch pad, a scanner, and a trackball, for example. Additionally, the user input device 1140 may include an optical imager device to scan a 2D barcode representation or optical character recognition (OCR)-recognizable text presented by the user on plain paper. In aspects of the present invention, the user input device 1140 may be wired or wireless devices. Wireless devices may be BLUETOOTH devices, for example.

In aspects of the present invention, the one or more environment sensors 1150 may be any device capable of obtaining data from the environment that is associated with the image. For example, the one or more environment sensors 1150 may be built-in and/or externally connected sensors, which are used to gather complimentary information from the environment in which the image was taken. Examples of the one or more environment sensors 1150 include thermometers or other meteorological sensors, a body-temperature and blood pressure meter, a GPS locator, an electronic compass, a movement sensor, a speech recognizer, a radio frequency identification (RFID) sensor, a facial biometry reader, a fingerprint reader, an electronic key reader, and a timepiece, for example.

In further illustrating the different types and use of the above environment sensors 1150, it should be understood that the following examples are non-limiting. For example, environment sensors 1150 may be thermometers or other meteorological sensors if the associated image is of a cloud; may be a body-temperature and blood pressure meter if the associated image is of a patient in a hospital, a GPS locator if the associated image is of a historic building, and so on. In aspects of the present invention, the one or more environment sensors 1150 may be wired or wireless devices. Wireless devices may be BLUETOOTH devices, for example.

In aspects of the present invention, the smart meta-data collection agent 1160 may be any device capable of obtaining data of, or for use with, the image. For example, the smart meta-data collection agent 1160 may be used to perform a database query, run internet searches, and/or mine databases for data relating to the image. Accordingly, the smart meta-data collection agent 1160 may be an on-device database query agent, a remotely running internet search agent connected over TCP/IC, or a remote database mining agent connected over Global System for Mobile communications/General Packet Radio Service (GSM/GPRS) connection, in aspects of the present invention, though not limited thereto.

In various aspects, the user input device 1140, the one or more environment sensors 1150, and the smart meta-data collection agent 1160 may be devices, such as hardware, or may be software (SW) or firmware (FW) that runs on a processor or a dedicated device. The user input, the environmental data, and/or the collected data respectively output from the user input device 1140, the one or more environment sensors 1150, and the smart meta-data collection agent 1160 are collected in the meta-data capture driver 1120. The meta-data capture driver 1120 processes the user input, the environmental data, and/or the collected data, and generates an associated data from the user input, the environmental data, the collected data, and/or portions thereof. The associated data may already be a meta-data in a predetermined format. The associated data is then output to the data processor 1200.

In aspects of the present invention, the associated data corresponds to the obtained or captured image from the visual capture device 1130, and provides additional information about the image. For example, the associated data may be jargon term for an item that is input by the user if the image is of the item that is part of an inventory, or may be descriptive information of a location that is input by the user if the image is of that location, such as, a warehouse. On the other hand, the associated data may be the environmental data, such as temperature and/or humidity, obtained by the one or more environment sensors 1150 if the image is of a warehouse containing certain inventory, such as, ice cream. Additionally, the associated data may be the collected data, such as recall information, obtained by the data collection agent 1160 if the image is of merchandise that has been found defective.

Referring back to FIG. 3, the data processor 1200 processes the various acquired or received data (including the image and the associated data) from the data acquisition driver 1100, namely, from the visual image capture driver 1110 and the meta-data capture driver 1120. The data processor 1200 uses the acquired or received data to generate meta-data that is associated with the image, and embeds the meta-data to the image or provides the meta-data for later rendering. The data processor 1200 includes a visual meta-data extractor 1210, a meta-data processor 1220, a smart meta-data miner 1230, a raw-image merger 1240, a meta-data to raw-image formatter 1250, and a meta-data to data-block formatter 1260, for example.

In aspects of the present invention, the visual meta-data extractor 1210 may be a device, software (SW), or firmware (FW), and receives input of the obtained or captured image (o referred to as simply an image) in a predetermined data format or which has been preprocessed, analyzes the image, and extracts one or more meta-data that characterizes the image. Further, the visual meta-data extractor 1210 obtains additional meta-data from, or provides the meta-data to, one or more other components of the data processor 1200, such as the meta-data processor 1220. Additionally, the visual meta-data extractor 1210 outputs, to the raw-image merger 1240, the image either as is (referred to as a raw image), or enhanced by highlighting one or more characteristics in the image that were extracted and turned into meta-data (referred to as an enhanced image) according to a user preference.

In aspects of the present invention, the visual meta-data extractor 1210 may be a logical component implemented in software (SW) and/or firmware (FW), such as a field programmable gate array (FPGA) or a complex programmable logic device (CPLD), and is used to analyze the image in the predetermined data format or which has been preprocessed, and extracts (or generates) meta-data from the image. For example, the extracted meta-data may include information as to face recognition, biometry data extraction, OCR recognition, object recognition, and position/distance measurement, for example, from the image. Once the meta-data is extracted, the image may be output in an enhanced format to indicate that meta-data has been extracted from the image. For example, a face in the image may be highlighted. In aspects of the present invention, the preference for which characteristics of the image to extract as meta-data may be based on user preference.

In aspects of the present invention, the meta-data processor 1220 may be a device, software (SW), or firmware (FW), and receives the associated data (which may be meta-data in a predetermined format) from the meta-data capture driver 1120, the image from the visual meta-data extractor 1210, and/or the extracted meta-data from the visual meta-data extractor 1210. The meta-data processor 1220 analyzes the received associated data, the extracted meta-data, and/or the image, extracts additional meta-data from the associated data and/or the image, consolidates or generates a consolidated meta-data from the extracted meta-data and the additional meta-data that characterizes the image and information related to the image data, respectively, and outputs the consolidated meta-data to the meta-data to RAW image formatter 1250 and/or the meta-data to data-block formatter 1260.

In addition to the meta-data processor 1220 obtaining the extracted meta-data, or the image as is or in an enhanced format, from visual meta-data extractor 1210, the meta-data processor 1220 may provide one or more meta-data that have been consolidated or generated to the visual meta-data extractor 1210. Additionally, the meta-data processor 1220 may obtain supplementary meta-data from the smart meta-data miner 1230 so that the meta-data processor 1220 may generate or consolidate the consolidated meta-data by also using the supplementary meta-data or portions thereof. Further, the meta-data processor 1220 may provide the consolidated meta-data to the smart meta-data miner 1230. The supplementary meta-data is additional data that is obtained from an outside source, and may be obtained based on the associated data, the extracted meta-data, and/or the image.

In aspects of the present invention, the consolidated meta-data need not be generated or consolidated from the entire associated data, the extracted meta-data, the image, and/or supplementary meta-data by the meta-data processor 1220. Rather, the consolidated meta-data may be consolidated or generated from one or more portions of the associated data, the extracted meta-data, the image, and/or the supplementary meta-data, respectively. Examples of one or more portions of the associated data, the extracted meta-data, the image, and/or the supplementary meta-data may include information that relate to the image, such as, time the image was taken, geographical position the image was taken, types of objects depicted in the image, properties of the subjects (or depicted items) of the image, temperatures of the subject or the environment at the time when the image was taken, and/or skin tone, if the image is of a person.

The output of the meta-data processor 1220 is consolidated meta-data that is generated or consolidated from all the associated data, the extracted meta-data, the image, the supplementary meta-data, or portions thereof that were obtained from the meta-data capture driver 1120, the visual meta-data extractor 1210, and the smart meta-data miner 1230. The consolidated meta-data may be generated or consolidated also by combining or synthesizing the associated data, the extracted meta-data, the image, and/or the supplementary meta-data. Then, the consolidated meta-data is output from the meta-data processor 1220 to the meta-data to RAW image formatter 1250 and/or the meta-data to data-block formatter 1260.

In aspects of the present invention, the smart meta-data miner 1230 may be logical components implemented in software (SW), a device, or firmware (FW), and connects to remote information sources, such as data bases, search engines, and/or search agents that are accessible via a network and/or the internet. The smart meta-data miner 1230 searches and retrieves the supplementary meta-data that is additional to a base meta-data provided or queried by the meta-data processor 1220. As discussed above, the smart meta-data miner 1230 provides the retrieved supplementary meta-data to the meta-data processor 1220.

In aspects of the present invention, the meta-data to RAW image formatter 1250 is an image processing component implemented in software (SW) and/or firmware (FW), and projects or converts the consolidated meta-data onto a visual representation according to user preference, such as a 2D barcode. Hereinafter, the visual representation of the consolidated meta-data is referred to as a meta-data sub-image. The meta-data to RAW image formatter 1250 outputs the meta-data sub-image to the RAW-image merger 1240.

In aspects of the present invention, the RAW-image merger 1240 is an image processing component implemented in software (SW) and/or firmware (FW), and merges the raw image or the enhanced image from the visual meta-data extractor 1210 with the meta-data sub-image from the meta-data to RAW-image formatter 1250, into one overall merged image according to user preferences. In other aspects, the RAW-image merger 1240 embeds the meta-data sub-image into the raw image or the enhanced image to generate an embedded image. In aspects of the present invention, the merged image and embedded image may be referred to as simply a combined image. Accordingly, the combined image is output from the data processor 1200 to the data rendering driver 1300. Particularly, the combined image from the RAW-image merger 1240 is output to the image render 1310 in the data rendering driver 1300. In aspects of the present invention, the meta-data to data-block formatter 1260 is a data processing component implemented in software (SW) and/or firmware (FW), and formats the consolidated meta-data into a standalone user or machine recognizable representation of the consolidated meta-data according to user preferences. Once formatted, the meta-data to data-block formatter 1260 outputs the formatted consolidated meta-data representation to the data rendering driver 1300. Particularly, the formatted consolidated meta-data representation is output from the meta-data to data-block formatter 1260 to the data render 1320 of the data rendering driver 1300. Hereinafter, the formatted consolidated meta-data representation will be referred to as a standalone meta-data representation.

It should be understood that in some aspects of the present invention, the RAW-image merger 1240 receives the raw image or the enhanced image from the visual meta-data extractor 1210, and simply outputs the raw or the enhanced image to the image render 1310, without creating a combined image, according to user preference. If the raw or the enhanced image is simply output, the standalone meta-data representation from the data renderer 1320 may be output to be physically attached or adhered to the raw or the enhanced image that is rendered by the image renderer 1310.

Referring back to FIG. 3, the data rendering driver 1300 receives the combined image and/or the standalone meta-data representation from the data processor 1200, and outputs a primary output 1330 that is a visually rendered combined image and/or a secondary output 1340 that is a rendered standalone meta-data representation. The data rendering driver 1300 includes the image renderer 1310 and the data renderer 1320. The image renderer 1310 receives the combined image and outputs the rendered combined image, and the data renderer 1320 receives the standalone meta-data representation and outputs the rendered standalone meta-data representation. Once output, one or both of the primary output 1330 and the secondary output 1340 may be saved or stored in an electronic form or other format in the apparatus 1000, and/or transferred to an outside device for later processing, rendering, or publishing. The storing, processing, rendering, and publishing of the primary output 1330 and the secondary output 1340 may include use of any storage medium or distribution network such that information systems or media broadcasting services are able to search for, process, and render primary output 1330 in the form of the combined image and/or the secondary output 1340 in the form of the standalone meta-data representation.

In aspects of the present invention, the image renderer 1310 is a component implemented in software (SW) and/or firmware (FW), and outputs a primary output 1330 that results from merging or embedding the meta-data sub-image into the raw image or the enhanced image, and then rendering the combined image. In aspects of the present invention, the primary output 1330 is a visually readable image or a picture, and having the meta-data sub-image that overlies a selected portion of the image or the picture according to user preference. In aspects of the present invention, the location of the meta-data sub-image in the image, when rendered, may be in a predetermined portion of the image. Examples of such locations include a particular corner of a rectangular image, or a border of the rectangular image that may be added or augmented to an existing or standard border. Additionally, the meta-data sub-image, when rendered with the image, may be a watermark that is semi-visible, semi-transparent, or translucent.

Additionally, in an aspect of the present invention, the meta-data sub-image may be rendered as a symbol, text, script, or barcode, where examples of such barcodes include linear barcodes, such as, universal product codes (UPC), or matrix or 2D barcodes, such as, an Aztec code. In aspects of the present invention, the usable type of barcode or other symbols to render the meta-data sub-image is not limited. Additionally, in aspects of the present invention, the meta-data sub-image may be rendered to be located in the upper-right corner of the image.

In aspects of the present invention, the combined image may be rendered or output by being printed on a medium. In aspects of the present, the medium may be paper, a plastic card, a label, for example. Accordingly, the combined image may be implemented as a photograph, a PDF file, an x-ray image, or an image that is displayed on display screen, for example. If the output is printed on a medium, a printer may be further utilized to print the combined image to the medium.

In aspects of the present invention, the data renderer 1320 is a component implemented in software (SW) and/or firmware (FW), and outputs a complementary output 1340 that is the standalone meta-data representation. The standalone meta-data representation is rendered in a selected structured format, which can be used to physically attach the rendered standalone the meta-data representation to a rendered raw image or the enhanced image. Additionally, the standalone meta-data representation may be used to electronically attach the standalone meta-data representation to the raw image or the enhanced image in their electronic formats. Additionally, the standalone meta-data representation may simply be stored electronically.

To elaborate, the physical attachment of the rendered standalone meta-data representation to the rendered raw image or the enhanced image may be exemplified by physically attaching a printed barcode to a separately printed image. However, the separate printing of the image related to the standalone meta-data representation is not required. For example, in aspects of the present invention, the standalone meta-data representation may be in a format, namely, a visual picture file, such as bitmap, that contains a 2D barcode to be printed as a label. The label can then be physically attached unto an item that was the subject of the image used to generate the 2D barcode. The item that is the subject of the image may be a mail parcel, and the label may be an address label for the mail parcel. Other items that may be the subject of the image may include a rental car, warehouse inventory items, or air passenger luggage.

On the other hand, the electronic attachment of the standalone meta-data representation to the raw image or the enhanced image may be exemplified by associating date/time information, shutter speed information, and/or light conditions information to a regular digital camera JPEG file. Finally, the electronically storing of the standalone meta-data representation may be exemplified by storage of the standalone meta-data representation in a computer database. If stored in such a manner, the standalone meta-data representation and information thereof can be published to be searchable by smart data search engines, information systems, and/or media broadcasting services.

As discussed above, the image renderer 1310 outputs a primary output 1330 that results from merging or embedding the meta-data sub-image into the raw image or the enhanced image, and which is the combined image, and the data renderer 1320 outputs a complementary output 1340 that is the standalone meta-data representation. In aspects of the present invention, one or both of the primary output 1330 in the form of the combined image and the secondary output 1340 in the form of the standalone metadata representation may be saved or stored in an electronic form or other format in the apparatus 1000, and/or transferred to an outside device or the renderer for later processing, rendering, or publishing. In aspects of the present invention, the transfer to the outside device or renderer to be processed, rendered, or published may be by way of a network or the internet, and the outside device or renderer may be a remote network server or a mobile device. Once stored or published through use of any storage medium or distribution network, the combined image and/or the standalone meta-data representation may be searchable or searched by information systems or media broadcasting services.

Additionally, the one or both of the primary output 1330 in the form of the combined image and the secondary output 1340 in the form of the standalone metadata representation may be output in the electronic form or the other format to a virtual medium, or may be broadcast. In aspects of the present invention, the virtual medium may be a networked server hard-drive, or a remote mobile device file-system. The primary and/or the secondary output to be broadcast may be forwarded to a public-internet or company-intranet web-page, or output as a broadcast video stream to be broadcast for viewing via a display or a television, or output as a web page RSS-feed to be distributed over a community of mobile devices, or any combinations thereof. Additional types of publishing or broadcasting formats may include JPEG and MPEG formats for displays, such as on LCD-TV screens, or BITMAP format for mobile devices, such as personal digital assistants (PDAs) or cell phones.

The above aspects of FIG. 3 discusses the apparatus 1000 in terms of images, so that the apparatus 1000 acquires image data and other data corresponding to the image data; generates a meta-data based on the image data and the corresponding other data; merges or embeds the meta-data into the image and/or providing a standalone meta-data; and renders a combined image and/or the standalone meta-data. However, it should be understood that the apparatus need not be limited to handling images, and the inputs for the apparatus 1000 can be non-visual inputs, such as, sound or machine readable data. In general, the inputs for the apparatus 1000 may simply be any data that can be merged or embedded with a corresponding meta-data.

Accordingly, if the input for the apparatus 1000 is music, the data acquisition driver 1100 acquires, or receives input of, music data and other data corresponding to the music data (referred to as the associated data), and outputs the music data and the associated data to the data processor 1200. In turn, the data processor 1200 processes the acquired or received music data and the associated data, generates meta-data based on the music data, the associated data, or both, converts the meta-data into an audible format (referred to as a sub-aural meta-data) or a data block format (referred to as a block meta-data), and either embeds the sub-aural meta-data into the music data or provides the block meta-data for later standalone rendering. Additionally, the rendering driver 1300 receives the music data embedded with the sub-aural meta-data, and the block meta-data from the data processor 1200; and outputs the music data embedded with the sub-aural meta-data as output sound, or renders the block meta-data as an aural standalone meta-data, or both. The embedded sub-aural meta-data may be reproduced before or after the music data, in aspects of the present invention.

FIG. 4 illustrates a method of embedding meta-data to an obtained data according to an aspect of the present invention. In aspects of the present invention, the method of FIG. 4 may be practiced using the apparatus 200 as shown in FIG. 2.

Referring to FIG. 4, first data, and second data that is associated with or corresponds to the first data, are acquired or received in operation S410. The acquired or received first data and the associated second data is processed to generate meta-data based on the first data, the associated second data, or both, in operation S420. The, the meta-data is converted into a format of the first data (referred to as a sub meta-data) or another format (referred to as a block meta-data), and the meta-data is embedded into the first data as a combined data. Accordingly, the first data and the meta-data are integrated into a combined data in operation S430. In other aspects of the present invention, the meta-data may also be provided additionally as a standalone meta-data for later rendering.

Once the first data and the meta-data are integrated into the combined data, the combined data is output in a predetermined format, and/or standalone meta-data is rendered in the same or different format as the predetermined format in operation S440. In aspects of the present invention, the first and second data may be an image, video, audio, sound, text, music, or in other human user or device perceivable formats.

FIG. 5 illustrates a method of embedding meta-data in an image according to an aspect of the present invention. The method of FIG. 5 may be practiced using the apparatus 1000 as shown in FIG. 3. Referring to FIG. 5, an image is obtained, for example, by being input or captured in operation S510. The obtained image may be a raw image in a predetermined data format, or may have been preprocessed for ease of later processing. Contemporaneously, or after the image is obtained, additional data is obtained that is of the obtained image, referred to as associated data, in operation S520. The associated data includes a user input data, environmental data, and/or collected data that is associated with the obtained image.

In aspects of the present invention, the user input data include preferences and various data input by the user. Examples of the user input may be an image, video, audio, sound, and/or text, and may be obtained from, a key pad, a mouse, a touch screen, a touch pad, a scanner, microphone, a digital camera, a video recorder, and/or a trackball, for example. Other examples of the user input may include a 2D barcode representation or optical character recognition (OCR)-recognizable text.

In aspects of the present invention, the environmental data includes data from the environment that is associated with the image. Examples of the environmental data may be temperature, humidity, pressure, GPS location, or time, and may be obtained from thermometers, pressure meter, a GPS locator, or a timepiece, for example.

In aspects of the present invention, collected data include any data for use with the image that is obtained from an outside source. Examples of the collected data may be recall data, for example, and may be obtained from a database query, internet searches, and/or mined databases for data relating to the image.

Referring back to FIG. 5, the image in the predetermined data format, or which has been preprocessed, is analyzed, and meta-data is extracted (or generated) from the image in operation S530. In aspects of the present invention, the extracted meta-data may include information as to face recognition, biometry data extraction, OCR recognition, object recognition, and position/distance measurement, for example, from the image. When the meta-data is extracted from the image, the image may be enhanced to indicate that meta-data has been extracted from the image. For example, a face in the image may be highlighted.

The above noted associated data, the extracted meta-data from the image, and/or the image itself, are consolidated to generate a consolidated meta-data in operation S540. Optionally, supplementary meta-data may further be obtained in operation S540 and may be included to generate the consolidated meta-data. The supplementary meta-data is obtained from an outside source, and may be obtained based on the associated data, the extracted meta-data, and/or the image. In aspects of the present invention, the consolidated meta-data need not be generated from the entire associated data, the extracted meta-data, the image, and/or supplementary meta-data. Rather, the consolidated meta-data may be generated from one or more portions of the associated data, the extracted meta-data, the image, and/or the supplementary meta-data.

Referring back to FIG. 5, a visual representation of the consolidated meta-data is generated by converting the consolidated meta-data into the same format as the image, for example, according to user preference in operation S550. The visual representation of the consolidated meta-data is referred to as a meta-data sub-image, and may be a 2D barcode. Optionally, the consolidated meta-data may be converted into a standalone meta-data representation to be later output in a desired format. Once the consolidated meta-data is generated, the raw image or the enhanced image is merged or embedded with the meta-data sub-image into a combined image according to user preferences in operation S560. Finally, the combined image may be output, or optionally, the image and a corresponding standalone meta-data representation may be output in operation S570. The output combined image and/or the image and a corresponding standalone meta-data representation may be published through use of any storage medium or distribution network to be searchable or searched by information systems or media broadcasting services. Additionally, the output combined image and/or the image and a corresponding standalone meta-data representation may be further output as a broadcast audio and/or video stream to be broadcast for viewing via a display or a television, or output as a web page RSS-feed to be distributed over a community of mobile devices.

FIG. 6A illustrates an image with an embedded meta-data, referred to as a combined image, and 6B illustrates a standalone meta-data representation, which may be affixed to a separately rendered image, according to aspects of the present invention. As shown in FIG. 6A, the combined image 600 according to an aspect of the present invention is composed of a photographic image of a person 601, and an embedded meta-data sub-image 602 that is positioned in the upper right corner of the combined image 600. The person's image 601 is based on the obtained image from a digital camera (used as the visual capture device 1130 as shown in FIG. 3). The meta-data sub-image 602 is one that is generated from a consolidated meta-data using at least portions of the associated data, the extracted meta-data, the image, and/or supplementary meta-data, as discussed above with reference to FIGS. 3 and 5. The embedded meta-data sub-image 602 is shown as a 2D barcode, i.e., as an Aztec code.

As shown in FIG. 6A, the combined image is visually readily perceivable by a human user, programmatically perceivable by a device, or both. For example, when implemented to a passport photograph, the combined image 600 includes passport holder's image 601 as the subject image, and the embedded meta-data sub-image 602 that is associated with the passport holder. The entire combined image 600 is one that a human user can recognize immediately as a photograph having the Aztec code. Further, the human user can obtain information contained in the Aztec code by using an appropriate reader. Additionally, the entire combined image 600 is one the appropriate reader can be used to programmatically perceive both the photograph and the Aztec code.

In the aspect shown in FIG. 6A, the embedded meta-data sub-image 602 is an image representation of the consolidated meta-data, which in turn is generated based on the associated data (including a user input data, the environmental data, and/or the collected data that is associated with the image), the extracted meta-data (based on the image), the image, and/or supplementary meta-data. That is, in the case of FIG. 6A, the consolidated meta-data may include personal identifying information about the passport holder shown in the image 601, such as, name and birthday, as user input data; GPS coordinates of where the image was taken as environmental data; and/or number of previous passports issued as collected data. Further, the consolidated meta-data may include visually identifying information, such as hair color or eye color, as extracted meta data based on the image; and confidential State Department code to insure authenticity of the image as the supplemental meta-data. Accordingly, an image and a corresponding meta-data are rendered together, whereby the corresponding meta-data (the consolidated meta-data) is embedded in the image.

As shown in FIG. 6B, the standalone meta-data representation 650, which may be affixed to a separately rendered image is shown. The standalone meta-data representation 650, when rendered, may be a 2D barcode, such as, an Aztec code, as shown. In aspects of the present invention, the standalone meta-data representation 650 may be embodied as a self-adhering strip, such as a sticker, which may be affixed to a corresponding image, which may appear similar to FIG. 6A. In other aspects, the standalone meta-data representation 650, as a sticker, may be affixed to an item, such as a package, in order to provide information about the package, such as to identify the content and the destination.

As discussed above, aspects of the present invention relate to a method of embedding a meta-data that corresponds to an image into the image, and an apparatus to embed the corresponding meta-data into the image. Accordingly, additional information for an image is effectively collected when the image is captured, the additional information is to be transformed into a usable format as the corresponding meta-data, and embedded into a file of the captured image. The imbedded corresponding meta-data and the captured image may be rendered and perceived by a human user, a device, or both. Further, aspects of the present invention is not limited to an image, and are applicable to video, audio, sound, text, music, or in other human user or device perceivable formats.

In various aspects of the present invention, meta-data is information about the subject data, such as image, video, audio, sound, text, music, or data in other human user or device perceivable formats. Meta-data may document data about elements or attributes (name, size, data type, etc); about records or data structures (length, fields, columns, etc.); and about the data (where it is located, how it is associated, ownership, etc.), of a subject data. Also, meta-data may include descriptive information about the context, quality and condition, or characteristics of the subject data. Accordingly, meta-data is usable to facilitate the understanding, characteristics, and management usage of the subject data.

In aspects of the present invention, the meta-data sub-image is a code, and is neither text nor script.

In various aspects, and/or refers to alternatives chosen from available elements so as to include one or more of the elements. For example, if the elements available include elements X, Y, and/or Z, the and/or refers to X, Y, Z, or any combination thereof.

Although a few aspects of the present invention have been shown and described, it would be appreciated by those skilled in the art that changes may be made in the aspects without departing from the principles and spirit of the invention, the scope of which is defined in the claims and their equivalents. 

1. An apparatus to generate and render data embedded with associated meta-data in a human or machine recognizable format, comprising; an obtaining device to acquire first data in a predetermined format and associated second data comprising information of the first data, and to output the first data and the associated second data; a processing device to receive the first data and the associated second data from the obtaining device, to process the first data and the associated second data to thereby generate meta-data based on the first data and/or the associated second data, to convert the meta-data into the predetermined format of the first data, and to embed the converted meta-data into the first data as a combined data in the predetermined format; and a rendering device to receive the combined data from the processing device, and to render the combined data in the human or machine recognizable format.
 2. The apparatus of claim 1, wherein the processing device further converts the meta-data into a block meta-data of another format.
 3. The apparatus of claim 2, wherein the rendering device further renders the first data in the human or machine recognizable format, and renders the block meta-data as a standalone meta-data in the human or machine recognizable format.
 4. The apparatus of claim 1, wherein the human or machine recognizable format includes a format that can be rendered, published, searched, or broadcast as an image, video, audio, sound, text, music, broadcast video stream, a web page RSS-feed, or a combination thereof.
 5. The apparatus of claim 1, wherein the human or machine recognizable format includes JPEG (Joint Photographic Experts Group), MPEG (Moving Picture Experts Group), PDF (Portable Document Format), and TIFF (Tagged Image File Format).
 6. The apparatus of claim 1, wherein the converted meta-data is a 2D barcode, and the first data is a photographic image.
 7. An apparatus to generate and render an image embedded with associated meta-data, comprising; a data acquisition driver to acquire an image, and associated data comprising information of the image; a data processor to process the acquired image and the associated data from the data acquisition driver in order to generate meta-data based on the image, the associated data, or both, to convert the meta-data into a same format as that of the image, and to embed the converted meta-data into the image; and a rendering driver to receive the image embedded with the converted meta-data from the data processor, and to render the image with the embedded converted meta-data as an output image.
 8. The apparatus of claim 7, wherein the data processor further converts the meta-data into a data block format, and the rendering driver renders the meta-data in the data block format as a standalone meta-data in the same format as that of the image.
 9. The apparatus of claim 7, wherein the converted meta-data is a 2D barcode, and the image embedded with the converted meta-data is a photographic image containing the 2D barcode.
 10. The apparatus of claim 7, wherein the data acquisition driver further comprises: a visual image capture driver to capture the image in a predetermined format, and to output the captured image as the acquire image to the data processor; and a meta-data capture driver to obtain at least one of user input data, environmental data, and collected data for the acquired image, and to output to the data processor the obtained at least one of user input data, environmental data, and collected data as associated data comprising information of the acquired image.
 11. The apparatus of claim 7, wherein the data processor further comprises: a visual meta-data extractor to analyze the image from the data acquisition driver and extract one or more extracted meta-data that characterizes the image, and to output the image or an enhanced image according to a user preference, wherein the enhanced image includes one or more highlighted characteristics of the enhanced image that have been extracted into meta-data; a smart meta-data miner to connect to remote information sources and to obtain supplementary meta-data that are additional information of the image from the remote information sources; a meta-data processor to extract additional meta-data from the associated data, the image, and/or the enhanced image, and to generate consolidated meta-data from the extracted meta-data, the additional meta-data, and/or the supplementary meta-data; a meta-data to raw-image formatter to convert the consolidated meta-data from the meta-data processor into a meta-data sub-image, which is a visual representation of the consolidate meta-data; a raw-image merger to combine the image or the enhanced image from the visual meta-data extractor with the meta-data sub-image from the meta-data to raw-image formatter into a combined image; and a meta-data to data-block formatter to selectively format the consolidated meta-data from the meta-data processor into a standalone meta-data representation that is user or machine recognizable, according to user preference.
 12. The apparatus of claim 11, wherein the meta-data processor further provides the consolidated meta-data to the visual meta-data extractor.
 13. The apparatus of claim 11, wherein the meta-data processor provides the extracted meta-data and/or the additional meta-data to the smart meta-data miner so that the smart meta-data miner obtains the supplementary meta-data of the image from the remote information sources based on information provided by the image, the extracted meta-data, and/or the additional meta-data.
 14. The apparatus of claim 11, wherein the rendering driver further comprises: an image renderer to receive the combined image from the raw-image merger, and to output the rendered combined image as a photographic image having the meta-data sub-image visually rendered and overlying a predetermined portion of the photographic image; and a data renderer to receive the standalone meta-data representation, and to selectively output a visually rendered standalone meta-data representation.
 15. The apparatus of claim 14, wherein the image renderer further receives the image and/or the enhanced image, and selectively renders the image and/or the enhanced image as a photographic image, and the data renderer renders the standalone meta-data representation visually to be physically attached to the rendered image or enhanced image.
 16. The apparatus of claim 7, wherein the image is in a Joint Photographic Experts Group (JPEG) format, and the converted meta-data is a 2D barcode.
 17. A method of generating and rendering data embedded with associated meta-data in a human or machine recognizable format, comprising obtaining first data in a predetermined format, and second data that is associated with the first data; processing the obtained first data and the associated second data to generate meta-data based on the first data, and/or the associated second data, converting the meta-data into the predetermined format of the first data; embedding the converted meta-data into the first data to obtain a combined data; and rendering the combined data in the predetermined format of the first data, the predetermined format of the first data being the human or machine recognizable format.
 18. The method of claim 17, wherein the human or machine recognizable format includes a format that can be rendered, published, searched, or broadcast as an image, video, audio, sound, text, music, broadcast video stream, a web page RSS-feed, or a combination thereof.
 19. The method of claim 17, wherein the first data is a photographic image and the converted meta-data is a 2D barcode positioned in a predetermined position of the photographic image.
 20. A method of embedding meta-data in an image to be rendered together, comprising: obtaining the image; obtaining associated data contemporaneously with the image, the associate data being additional information of the image and includes a user input data, environmental data, and/or collected data that is associated with the image; extracting meta-data from the image that characterizes the image according to user selection as an extracted meta-data; generating a consolidated meta-data by consolidating the associated data and the extracted meta-data; generating a meta-data sub-image, which is a visual representation of the consolidated meta-data, by converting the consolidated meta-data into the same format as the image; embedding the meta-data sub-image into the image to generate a combined image in the same format as the image; and visually rendering the combined image on a medium or a display device. 